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Abstract 

General relativity has a Hamiltonian formulation, which formally provides a canonical (Liouville) 
measure on the space of solutions. In ordinary statistical physics, the Liouville measure is used 
to compute probabilities of macrostates, and it would seem natural to use the similar measure 
arising in general relativity to compute probabilities in cosmology, such as the probability that 
the universe underwent an era of inflation. Indeed, a number of authors have used the restric- 
tion of this measure to the space of homogeneous and isotropic universes with scalar held matter 
(minisuperspace) — namely, the Gibbons-Hawking-Stewart measure — to make arguments about the 
likelihood of inflation. We argue here that there are at least four major difficulties with using the 
measure of general relativity to make probability arguments in cosmology: (1) Equilibration does 
not occur on cosmological length scales. (2) Even in the minisuperspace case, the measure of phase 
space is infinite and the computation of probabilities depends very strongly on how the infinity is 
regulated. (3) The inhomogeneous degrees of freedom must be taken into account (we illustrate 
how) even if one is interested only in universes that are very nearly homogeneous. The measure 
depends upon how the infinite number of degrees of freedom are truncated, and how one defines 
"nearly homogeneous." (4) In a universe where the second law of thermodynamics holds, one 
cannot make use of our knowledge of the present state of the universe to "retrodict" the likelihood 
of past conditions. 
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I. INTRODUCTION 



Some of the most fundamental issues in cosmology concern the state of the universe at 
its earliest moments, for which we have very little direct observational evidence. In this 
situation, it is natural to attempt to make probabilistic arguments to assess the plausibility 
of various possible scenarios. For example, one could try to argue that it is "highly im- 
probable" that the universe could have been in a homogeneous, isotropic, and very nearly 
spatially fiat initial state, so that a mechanism like inflation occurring in the very early 
universe is necessary to explain the observed homogeneity, isotropy, and long age of the 
present universe. Similarly, one could try to make arguments as to whether the occurrence 
of inflation itself in the early universe was "probable." Clearly, in order to make any proba- 
bilistic arguments, one needs a measure on the state space of the universe. General relativity 
has a Hamiltonian formulation, so — formally, at least — it gives rise to a canonical measure 
on phase space. When restricted to minisuperspace, this canonical measure — known as the 
Gibbons-Hawking-Stewart (GHS) measure — has been used to make probabilistic arguments 
in cosmology, such as whether inflation was probable P-H]. The main purpose of this pa- 
per is to provide critical examination of the use of these types of probability arguments in 
cosmology. 

As we shall discuss in detail in section [IT], the phase space of general relativity has many 
similarities to finite dimensional phase spaces of classical particle systems, most importantly 
the existence of a symplectic form which is conserved under time evolution [5J. However, 
there are also significant differences. General relativity is a field theory, so its phase space 
is infinite dimensional. In the finite dimensional case, one can get the conserved canonical 
(Liouville) measure by taking the top exterior product A"u; of the conserved symplectic form 
uj. But in the infinite dimensional case this does not make sense. Thus, to get a canonical 
measure for general relativity, we must truncate the degrees of freedom by imposing long 
and short wavelength cutoffs, so that we are effectively reduced to finitely many degrees of 
freedom. A long wavelength cutoff is naturally imposed by assuming that the cosmological 
models under consideration have compact Cauchy surfaces. A short wavelength cutoff is 
effectively imposed by quantum theory and/or the weak coupling of the degrees of freedom 
of interest to very high energy modes. With the imposition of these cutoffs, a canonical 
measure can be obtained from u as in the finite degrees of freedom case. 
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However, an even more significant difference between general relativity and ordinary 
Hamiltonian systems of particles concerns the presence of constraints and the nature of 
these constraints. On account of the Hamiltonian and momentum constraints, not all points 
of phase space are allowed initial data for solutions. Thus, one must restrict consideration 
to the submanifold, C, of points of phase space that satisfy the constraints. However, 
when pulled back to C, the symplectic form becomes degenerate. The degeneracy directions 
of the puUback, uj, of the symplectic form, u, to C are precisely the infinitesimal gauge 
transformations, i.e., the changes in initial data corresponding to performing infinitesimal 
spacetime diffeomorphisms on the corresponding solutions. One can remove this degeneracy 
in a natural manner by passing to the gauge equivalence classes of initial data, i.e., by 
considering a new phase space whose points correspond to physically distinct spacetime 
solutions. One can then define a canonical measure on (a truncated version of) this space. 
However this gives rise to a major, additional conceptual issue: "Gauge" (= spacetime 
diffeomorphisms) includes "time evolution," so once "gauge" has been eliminated from the 
phase space, there is no longer any notion of "time evolution." But in the absence of 
nontrivial time evolution, one cannot make any arguments concerning dynamical behavior, 
such as equilibration, the second law of thermodynamics, etc. Note that this is a classical 
version of the "problem of time" that arises in canonical quantum gravity. 

This issue can be dealt with by choosing a representative of each gauge orbit rather 
than passing to the space of gauge orbits. "Time evolution" then gets replaced by "change 
of description" when a different representative of each gauge orbit is chosen. The main 
difficulty with proceeding in this manner is that it does not appear possible to choose a 
unique gauge representative globally in phase space in a continuous manner, so this strategy 
cannot be implemented globally. Nevertheless, it is plausible that Cauchy surfaces with 
constant mean curvature K = K^, are unique in certain classes of "nearly FLRW" models 
with compact Cauchy surfaces. For the purposes of this paper, we will assume that the 
condition K = K^, picks out a unique Cauchy surface in the spacetimes under consideration 
here. In that case, "time evolution" is replaced by "change of description" of initial data 
when the choice of representative Cauchy surface is changed from from K = K^, to K = K'^. 
Thus, K effectively plays the role of "time" P, [7] , and one recovers a notion of dynamical 
evolution, although with a "time dependent" Hamiltonian. 

The above considerations bring general relativity to a framework that is close to the 
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framework of ordinary statistical physics. In ordinary statistical physics, the canonical 
measure has been successfully used to make probability arguments, and one might attempt 
to make similar arguments in cosmology. However, there are a number of serious difficulties 



in doing so. As we shall discuss in section III, one difficulty concerns the effective time 
dependence of the Hamiltonian and the insufficient time for equilibration to have occurred 
on scales comparable to the Hubble radius, thereby removing the usual justification for 
the use of the canonical measure. Another serious problem concerns the fact that even in 
minisuperspace models, the phase space of general relativity is non-compact, and the total 
measure of phase space is infinite. Thus, the "probability" of any subset of phase space is 
either (if its measure is finite), 1 (if the measure of its complement is finite), or ambiguous 



(if both the set and its complement have infinite measure). In section IV, we shall discuss 
the calculation of the "probability of infiation" and — in agreement with a much earlier 
discussion by Hawking and Page [2] — explain why different authors have obtained widely 
different results using ostensibly the same (conserved!) GHS measure on minisuperspace. 
In section |V| we consider inhomogeneous degrees of freedom. We briefly discuss some of 
the additional difficulties that arise when considering the inhomogeneous degrees of freedom 
in cosmology, such as placing a "short wavelength cutoff" in an expanding universe. Most 
of section |V] is devoted to deriving the modifications of the GHS measure that take the 



effects of inhomogeneous perturbations into account. Finally, in section VI we discuss the 
fact that in a universe with a thermodynamic arrow of time, one cannot "retrodict" the 
past by knowing the present state of the universe and determining which histories are most 
"probable" in the canonical measure; the second law of thermodynamics guarantees that 
the actual history of a system is extremely rare in the space of all possible histories that are 
consistent with present course-grained observations. 

We conclude this introduction by mentioning an issue that we will not address in this pa- 
per. The fact that we are conscious observers undoubtedly places a bias on our observations: 
We are only able to directly observe portions of spacetime that can admit the presence of 
conscious observers. Therefore, in order to obtain the probability that a region of spacetime 
will be observed to have a certain property, we should multiply the a priori probability of 
the existence of such a region of spacetime by the probability that conscious observers exist 
in this region. Unfortunately, there are few things we know less about in science than the 
conditions needed for the existence of conscious observers (or, indeed, even what "conscious 
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observers" are). In many discussions, it is implicitly or explicitly assumed that various con- 
ditions we observe to occur in our universe are necessary for conscious observers to exist. It 
therefore should not be surprising if the conclusion then drawn is that the universe should 
look similar to what we observe it to be, even if the a priori probability of what we observe 
(without taking into account the bias of our existence) is extremely small. Thus, although 
such anthropic arguments can be used to attempt to salvage a theory that predicts a very 
small a priori probability for what we observe, we do not feel that any reliable anthropic 
types of arguments can be made until we have a much better understanding of the likelihood 
that conscious observers could exist in conditions very different from those of our own uni- 
verse. Thus, we shall not consider anthropic arguments in this paper, nor shall we consider 
any issues that involve the likelihood of the existence of observers and/or civilizations with 
given characteristics [8HTT]. including the "measure problem" as discussed in, e.g., [T2| [13]. 

II. THE LIOUVILLE MEASURE FOR GENERAL RELATIVITY 
A. The Liouville Measure for Particle Mechanics 

We begin by briefly reviewing how the Liouville measure on phase space is constructed in 
the case of usual Hamiltonian particle mechanics. For a Hamiltonian system with n degrees 
of freedom, the phase space is 2n dimensional and can be labeled by the canonically conju- 
gate coordinates {q^,Pi)- If the Hamiltonian of the system is 'H{q^,Pi), then the dynamical 
trajectories of the system in phase space are the integral curves of the time evolution vector 
field T given by 

j.^y(dnd__dnd_\ 
^ \ dpi dq^ dq^ dpt J 

The symplectic form on phase space is given by 

n 

uj = ^dpiA dq' . (2) 

i=l 

It is obvious from its definition that du = and that dH = —T ■ u, where the "•" denotes 
the contraction of T into the first index oi u. It then follows immediately that 



Ctco = T ■du + d{T ■u) = -d^U = 



(3) 
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By taking the wedge product of u with itself n times one obtains a volume element on phase 
space known as the Liouville measure VL: 

/_i\n(n-l)/2 

n = ^ A"w = rf>rf"g. (4) 

n\ 

It follows immediately from (|3]) that phase space volumes in the Liouville measure are 
preserved under time evolution, 

CtVL = . (5) 

One may also derive the symplectic form (and hence, the Liouville measure) starting 
with a Lagrangian formulation of the theory, and we will use an analog of this procedure 
to get the symplectic form in general relativity. We consider the space of paths g(t). For 
a Lagrangian of the form its first variation under a variation of path from q{t) to 

q{t) + 5q{t) is given by 



6L = Y^EM + J^{^P^Sq'^ 



(6) 



where 

F^ — --— (7) 
' dq' dtdct ^ ' 

are the Euler-Lagrange equations of motion and 
We write 

d = Y.P^^^' (9) 

i 

and define uj[5iq, 52q\ to be the antisymmetrized second variation of 6*, 

^[Siq, 52q] = Si {e[S2q]) - S2 . (10) 

Here Siq{t) and 62qit) denote two independent variations of the unperturbed path q{t). From 
(|9|, we obtain 

u;[6,q,52q] = J](5ip.)(52gO -(1^2). (11) 

i 

Thus, at time t, u depends on the path variation only via the variation of and Pi at time 
t. If we define phase space to be the space of all {q\pi), then co is non-degenerate on this 
space. The quantity uj[6iq,62q] at time t corresponds to the symplectic form appearing in 
(|2| contracted into the tangent vectors in phase space corresponding to the path variations 
{Siq, 62q) at time t. 
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B. The Phase Space of General Relativity 

We now consider the phase space of general relativity. There are two major differences in 
the construction of phase space and the symplectic form on phase space as compared with 
the particle mechanics case: (i) There are constraints in phase space, (ii) The phase space 
of general relativity is infinite dimensional. 

We will restrict consideration to globally hyperbolic spacetimes with compact Cauchy 
surfaces, so that integrals over Cauchy surfaces will automatically converge. We will ex- 
plicitly consider the case where the matter content is a minimally coupled scalar field in a 
potential — since our primary application will be to address probability issues in inflation- 
ary cosmology — but the discussion would be qualitatively the same for other matter fields 
provided only that the complete theory can be derived from a diffeomorphism covariant 
Lagrangian. The dynamical fields are then the spacetime metric gab and the scalar field (p, 
and we denote them together as x- 



The Lagrangian 4-form is (using SttG = m ? = 1 and the metric signature (— , +, +, +)) 



where R is the scalar curvature of gab, and eabcd is the volume form associated with gab- The 
scalar field's stress-energy tensor is 



We follow the prescription of [5] (see also [E]), to construct the symplectic form on the 
phase space of this Lagrangian field theory. When we vary the fields, 6x = {Sgab,Sip), the 
resulting variation of the Lagrangian is 



X = {gab, ■ 



(12) 




(13) 




(14) 



5^abcd[x;5x] = (E'^Sgef + E^dif) 



eabcd + {dO) 



abed 1 



(15) 



where 



E' 



tab 



(C^ yife^ 



(16) 



and 




(17) 



so that -E""* = and E^p = are the field equations for x- The symplectic potential 3-form 



appearing in ( 15 ) is given by 



Obcd [x; ^X] = 2 ( "^eSg"' - W'dg - 2 ( W) 5^ ) e^^cd 



(18) 



where all indices are raised by g""^ . In parallel with (10), for any two field variations 5ix and 
^2X5 we define 

Wbcd [x; ^ix, 52x] = ^1 {Gbcd [x; ^2x]) - ^2 [x; (^ix]) • (19) 



It follows from taking an antisymmetrized second variation of (15) that if 5ix and 52X are 
solutions of the linearized field equations, then 



abed 



0. 



(20) 



Performing the variations on (18), we obtain 
1 f 1 



1 



u^bcd = 2 <j ^iSigni'^eS2g) - {5,g,f){y'52g''f) + -(5l^7e/)(V'^52/0 
\{5,g){y''52g) + \{5,g){y e52gn + 2{ye^){5,gnM 
(VV)(5i^7)(52</^) - 2{V''5i^){52v) \eabcd -(1^2). 



(21) 



We define w [x; f^iX; '^2X] by integrating Wfecd over a Cauchy surface S, 



w [x; ^ix, '^2x] = j ^bcd ■ 



(22) 



It follows immediately from (20) that if 6ix and 62X are linearized solutions, then u is 



independent of choice of S. If we work in coordinates where S is a surface of constant 
x^, then we obtain. 



\ ^ d^x^g\^-{5,g'^){V,52g) - {5ig,.){V^52g"') + \{5,g,.){V%2gn 

- \{hg)iy'52g) + \{5^g){y ,52g'n + 2{y ,^){5,g'^){52^) (23) 

- (VV)(5i^7)(52<^) - 2{V%,^){52^) I - (1 ^ 2) 



This can be rewritten as 



U) 



^d'x^-{6r'K'''){62K,) + {6,'K^){62^) 



(24) 
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where hat is the spatial metric on S induced by gab, the conjugate momentum vr"* is given 
by 

tt'^* = y/h {K"^ - Kh"^) (25) 
(where K'^^ is the extrinsic curvature of S), and vr^ is given by 



^ 5(Vo^) 



-v^VV- (26) 



When written in the form (24), it is clear that at "time" S, u depends on the field variations 
5ix and only through the variations of hab-, tt"'^, ^p, and ir^p on S. Thus, if we define phase 
space, V, to be the space of all fields {hab, ^i""^, f, tt^) on S — where hab is a Riemannian metric, 
n"-^ is a tensor density, ip is a scalar field, and tt^^ is a scalar density — then u is non-degenerate 
on V and defines a symplectic form. 

Up to this point, our construction of the phase space of general relativity has been in 
direct parallel to the particle mechanics case, with the only significant difference being 
that V is infinite dimensional. However, a new issue now arises from the fact that general 
relativity has constraints — i.e., not all points of phase space are dynamically accessible — and 
we are only interested in the points in phase space that satisfy the constraints. However, 
the pullback, UJ, of u to the constraint surface, C, is degenerate. In fact, the degeneracy 
directions of uJ are exactly the "pure gauge" variations: 6x is a degeneracy direction of uJ 
if and only if 6x corresponds to a field variation arising from an infinitesimal spacetime 
diffeomorphism, i.e., 6x = Ce,X for some vector field If we wish to obtain a volume 
measure on phase space from cJ, it is essential that uj be non-degenerate. We can make cJ 
non-degenerate by "factoring out" the degeneracy directions, i.e., by passing to the space, V, 
of orbits of {hab, tt"^, vTi^) under spacetime diffeomorphisms. In other words, the "physical 
phase space," V, of general relativity can be taken to be the space of gauge orbits on C, and 
UJ induces a nondegenerate symplectic form on this space. However, since "time evolution" 
is implemented by spacetime diffeomorphisms, once we have passed to the space of gauge 
orbits, there is no remaining dynamics. Each point in the physical phase space represents 
one spacetime, once and for all, and there is no "motion through phase space" as there was in 
the particle mechanics case. Thus, a form of Liouville's theorem holds for the physical phase 
space of general relativity (for any choice of measure), but it is entirely trivial: Volumes are 
preserved under time evolution, since there is no such thing as time evolution! Clearly, this 
form of Liouville's theorem will not be useful for making statistical physics arguments. 
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However, a nontrivial notion of "time evolution" can be reintroduced as follows. Instead 
of passing to the space of gauge orbits, we can instead try to choose a representative of each 
gauge orbit. Ideally, one would wish to find a surface, S, in C with the property that each 
gauge orbit in C intersects S once and only once. The space S — with symplectic form given 
by the puUback of uJ to S — could then be used to represent the physical phase space. One 
could then ask how the description of the physical phase space would change if one made a 
different gauge choice, i.e., if one used a different surface S'. This "change of description" 
would correspond to "time evolution" (i.e., change of the choice of "representative Cauchy 
surface" in spacetime for each solution) together with additional changes in description 



induced by spatial diffeomorphisms. Note that, by the statement below (22), the map 
taking (iS,cJ) to {S',u) that associates points lying on the same gauge orbit preserves the 
symplectic structure. Thus, the "change of description" map has the same basic character 
as ordinary time evolution in Hamiltonian mechanics. 

Unfortunately, however, it clearly will not be possible to globally choose a smooth surface 
iS in C with the property that each gauge orbit intersects S once and only once; to do 
so would be tantamount to giving a prescription to uniquely fix coordinates for arbitrary 
solutions. The best one could realistically hope for is to find a family of surfaces {Sa} that 
work in localized regions of C, i.e., such that each gauge orbit intersects at least one Sa 
and no gauge orbit intersects any individual Sa more than once. For spacetimes with a 
compact Cauchy surface (as are being considered here), the choice of representative time 
slice given by setting the trace, K, of the extrinsic curvature equal to a given constant, 
K^,, is an excellent candidate for defining a surface, Sa, that has the desired property for a 
wide class of trajectories in C. Specifically, if the strong energy conditions holds. Brill and 
Flaherty |T3] have proven uniqueness of a constant mean curvature foliation, and existence 
is known to hold in a wide class of models [TB], although counterexamples are also known 
[I6] : see [171 [18] for further discussion. However, we cannot directly apply any of these 
results here because the models we consider in this paper with a scalar field in a potential 
do not satisfy the strong energy condition]^ Nevertheless, for FLRW models (see the next 
subsection), existence of constant mean curvature slices is obvious — they are the time slices 
picked out by the homogeneous and isotropic symmetry — and K (= 3H, where H is the 

^ Indeed, satisfaction of the strong energy condition would preclude the possibility of inflation. 
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Hubble constant) evolves via 



H 



1 .2 , 



(27) 



Thus, H is monotonic if the spatial curvature is or —1, although it need not be monotonic 
for K = +1. It appears plausible that solutions that are sufficiently close to FLRW models 
with K = 0, — 1 can be uniquely foliated by Cauchy surfaces of constant mean curvature, K. 
Our main interest in this paper is in cosmological models that are close to k = FLRW 
models, and we will assume that in the region of C under consideration here, the equation 
K = K^: (where K^, is any positive constant) uniquely selects a Cauchy surface for each 
solution. For simplicity, we will also assum^that a prescription has been given to uniquely 
fix spatial coordinates on the surface K = K^. 

The "time evolution" discussed above then corresponds to how the initial data {hab-, t^"'^, 
V9, vr^) changes when we go from the Cauchy surface K = K^, to the Cauchy surface K = K'^. 
Thus, K effectively plays the role of "time" (sometimes called "York time" [HI E])- This 
effective time evolution is generated by the Hamiltonian 



where C is the Hamiltonian constraint and Ca is the momentum constraint (see, e.g., |19]): 



Here the lapse, M, and the shift, A^'* must be chosen so as to preserve the gauge conditions; 
specifically, the lapse must be chosen so as to maintain the spatial constancy of K and the 
shift must be chosen to preserve whatever spatial coordinate conditions have been imposed. 
Thus, Af and A/"" will depend upon {hab, tt"^, ip, vr^), and their dependence on these variables 
will, in general, be non-local in space. Since JV and A^" as well as C and Ca depend on K^, 
the Hamiltonian evolution defined by "H is analogous to ordinary time evolution with a time 
dependent Hamiltonian. Note that = when evaluated on the constraint surface C in V, 
but its gradient in directions off of C is nonzero, so it generates non-trivial "time evolution" 
in the above sense. 



n = MC + AfC, 



(28) 




(29) 




(30) 



^ This assumption is not necessary, since instead of fixing spatial coordinates, we could work with the space 
of orbits under spatial diffeomorphisms. 
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The above considerations provide us with a notion of time evolution in general relativity 
on a phase space with a conserved symplectic form. However, the phase space of general 
relativity is infinite dimensional, and it does not make mathematical sense to attempt take 
an "infinite wedge product" analogous to Q to define a volume measure on phase space. 
We will deal with this difficulty by simply assuming that the effective number of degrees 
of freedom is actually finite. Since we are, in any case, considering only spacetimes with 
compact Cauchy surfaces, there are no degrees of freedom corresponding to "arbitrarily long 
wavelengths," so the assumption of effectively having only finitely many degrees of freedom 
amounts to imposing a short wavelength cutoff on modes. Such a cutoff in the degrees of 
freedom may be expected to arise at the Planck scale due to a discreteness of space or other 
fundamental differences between the true theory of nature and classical general relativity. 
However, an effective cutoff will occur in any case well before the Planck scale due to the fact 
that modes of the metric and scalar field that are of sufficiently "high energy" will not get 
excitec|^ and can thereby be ignored. If we consider only finitely many degrees of freedom 
and neglect any coupling to the degrees of freedom that are being ignored, then we may 
define a conserved Liouville measure via (|4]), in close analogy to particle mechanics. 

In section |V] we will briefly discuss some of the difficulties arising from performing such 
a truncation of the degrees of freedom of general relativity. We will then analyze the effects 
of the (large number of) degrees of freedom that are not being truncated on the measure of 
"nearly FLRW" universes. However, in the next subsection, we will consider the measure 
obtained on phase space by the above construction when one truncates all of the degrees 
of freedom except for the homogeneous modes ( "minisuperspace" ) . This will illustrate our 
general construction in a simple, concrete case and will also enable a comparison with 
previous work [T]-H]. 

C. The Liouville Measure on Minisuperspace 

We now consider Friedmann-Lemaitre-Robertson-Walker (FLRW) metrics 

ds"^ = -dt^ + a^{t)%jdx'dx^ , (31) 

^ In classical theory, all modes share energy equally in equilibrium (equipartition) , leading to an "ultraviolet 
catastrophe" if one has infinitely many degrees of freedom. The effective cutoff occurs because this is not 
the case in quantum theory. 
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where is a time independent Riemannian metric on a 3-space of constant curvature: 

^^^Rijki = K {'jikiji - liajk) , (32) 
with K = — 1,0,+1 for hyperbohc, flat, and spherical 3-geometries, respectively. As above 



(see (13)), the matter consists of a homogeneous scalar field ({){t) in the potential V{({)). 
Inserting two perturbations towards other FLRW models 

5gab = 2a{t)Sa{t)'jab , ^ 

(33) 

into (23) yields 

w = y d='a;v^{-6a(5id)(52a) + 30a'(5ia)(52<^) + a3(5i0)(52<^)} -(1^2). (34) 
This can be rewritten as 

a; = y" rf=^xy^{(5ip,)(52a) + {8rV^){8^v)} -(1^2) (35) 



where we have defined 

Pa = — 6ad 



(36) 



Thus, the initial data of the previous subsection — consisting of the tensor fields [hah-, tt'^^, v^, vr<^) 
on S — now reduces simply to the quantities {a,Pa,(p,p^), which depend only on t. Thus, 
when reduced to minisuperspace, the infinite dimensional phase space V of the previous 
subsection becomes 4-dimensional. Note that u is well defined only when the volume of 
space, J d^x^, is finite, i.e., only for the case of a compact Cauchy surface, as we have 
been assuming. Thus, for k = 0, we consider only the case where the universe is a 3-torus; 
for K = —1, we consider only cases where the universe is compactified similarly. For the 
remainder of the paper we will assume that the spatial coordinates have been scaled so that 
J cPxy/^ = 1. This enables us to write 

^ = {{SiPa){S2a) + {6,p^){62^)} -(1^2). (37) 

Note that even in the case k = 0, a has a well defined physical meaning: is the total 
volume of space. Comparing (37) to ^ and (11), we see that the symplectic form on 



minisuperspace is given by 

u = dpa Ada + dp^ A d(p , (38) 
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in agreemenl]^ with Gibbons, Hawking, and Stewart (GHS) [T]. 

The next step in deriving the measure on minisuperspace is to pull back u to the constraint 
surface to get u. In the homogeneous case, there is only one constraint, the Hamiltonian 
constraint, usually written in the form 



3 



where H = d/a is the Hubble 'constant'. Substituting from (36), we obtain 



2 2 

+ 6Ka - 2a^V{^) = . (40) 



6a 

Since the constraint surface is 3-dimensional (and 3 is an odd number), the pullback, u, 
of the symplectic form to the constraint surface is automatically degenerate. However, in 
accord with the ideas of the previous subsection, we will obtain a non-degenerate symplectic 
form if we pull u back to a surface S that intersects each "gauge orbit" (i.e., dynamical 
trajectory) exactly once. We choose S to be the surface H = H^,; this is guaranteed to work 



for K, = 0,-1, but need not define a unique slice for k = +1, as noted above (see (27)). 
Since S is 2-dimensional, the Liouville measure on S is just the pullback of u to S, 

Qcim = (^\s- (41) 

In the case of a massive scalar field 

Vi^) = ^-mV , (42) 
we can write the constraint in the form 

02 + - SKa^ = 6H^ , (43) 

where we have defined 

a = l/a. (44) 

The surface S is then the 2-dimensional surface in the space (mip, Lp, \/6a) defined by setting 
H = m. (43). For k = —1 this surface is a hemisphere (since a > 0) of radius y^-f^*, for 



Gibbons, Hawking, and Stewart derived their expression for the symplectic form by starting with a 
Lagrangian for a and which gave the correct FLRW equations of motion. (That Lagrangian agrees with 



what would be obtained by plugging an FLRW metric into the Lagrangian (131, up to total derivatives 
and a factor of the coordinate volume of space.) 
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K = it is a (a > half) cylinder of radius A/6if* centered on the a axis, and for k = +1 it 
is a (a > half) one-sheet hyperboloid of minimum radius \/QH^, centered on the a axis. 
It is convenient to use the coordinates (a, ip) on S. These coordinates do not properly 



cover S, since, as can be seen from (43), each allowed value of {a,(p) corresponds to two 
points of iS, one for each of the two possible signs of ip — except for the zero measure set 
defined by = 0. The coordinates are non-singular on each of the two non-overlapping 
patches with > and < 0, but become singular where = 0. In either of these 
nonsingular coordinate patches, the puUback of the symplectic form can be written 

^\s= 4-/ rr. d^^da, 45 



where (p{a, (p, H^:) is determined by (43) (with H = H^) up to a sign depending on which 



coordinate patch the point under consideration lies in, and if (a, v?, H^) is given by (27). 

For K = 0, —1, H is always negative, and we therefore can write the GHS measure as the 
positive measure on S (switching back from a to a) 

-6a'H{a,ip,H,) ^ ( 3a' (GH^ - m'cp' + AKa-') \ 

"GHS = — TTT rrr^ — dipda = , =^ dioda , (46) 

\ip{a,ip,H,)\ ^ \ ^QHl - m2(^2 + ^^^-2 j ^ ^ ^ > 

in agreement with expression^ given in [HIH]. Liouville's theorem then says that the measure 
of any set of trajectories (universes) is independent of the choice of H^,, as can be verified 
explicitly by using the evolution equations. 



For K = +1, (46) still holds, but the numerator now can be negative in some regions. 
This is a direct manifestation of the fact that the surface S given hy H = does not 
have the property that each trajectory intersects S only once. In effect, the GHS measure 
assigns negative weight to the trajectories that cross S in the "wrong" direction ^iJ > j . 
However, we can salvage a version of Liouville's theorem in this case by considering only 
the measure of subsets U C S that include all trajectory crossings, i.e., subsets U with the 
property that if p G f/ then every intersection of the trajectory determined by p with S also 
lies in U. If we consider only trajectories that satisfy if — i- oo in the asymptotic past and 
H — )■ — oo in the asymptotic future [21 [20], then the total number of crossings of S will be 



The divergence at — ^ is simply due to the breakdown of our coordinates there. If we instead had used 
coordinates {a,(p), then we would have obtained = a^m^y '^'^ ^ ^'^^ which is well behaved at = 
(but is singular at (/? = 0). However, in the case k = we have aj|_j = at the points where ip = 0. This 
is a real effect, reflecting the fact that the trajectories passing through these points are tangent to S, as 
they have H = -^ip'^ ^ 0^ from (l27). 
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odd, and the measure of any subset U of the above form will be positive. This measure will 
be independent of the choice of H^:. 



Returning to the case of general n, it is easily seen that if we integrate (46) over the 
allowed range of ip and a, we obtain 

riaus = OO , (47) 



s 



as has been previously noted by a number of authors |2]-H]. Thus, even for this simplest 
minisuperspace case, the total volume of the physical phase space is infinite. In section 



IV, we will discuss the resulting difficulties that arise if one attempts to make probability 



arguments using a measure where the measure of the total space is infinite. 



III. DIFFICULTIES WITH EQUILIBRATION 

For ordinary systems in statistical physics — such as a box of gas — the Liouville measure 
is used to determine the probability distribution for observables of interest. More precisely, 
for a time independent system with n degrees of freedom, the Liouville measure Q = A"cu 
gives rise to a measure Qm (the "microcanonical ensemble") on the constant energy surface 
Ti = Eq hj the formula Q = Qm A dTi. This has been successfully used to compute 
probabilities when the system has energy Eq. Despite assertions sometimes made to the 
contrary, the justification for using this measure is not based upon "ignorance" — nothing 
can be derived from ignorance — but rather on the dynamical evolution properties of the 
system. For a suitably ergodic system|^ Birkhoff's theorem asserts that for almost all states, 
the fraction of time spent in a region of the constant energy surface is proportional to the 
^Af-nieasure of that region. Thus, if we let O be the region of the constant energy surface 
corresponding to a particular macroscopic property of the system, then if one examines the 
system at a "random time" during its dynamical history, the probability that the system 
will possess the given macroscopic property should be proportional to the l^Af-measure of 
O. 

Of course, the amount of time needed by the system to properly explore all portions of 
its surface of constant energy is of order the Poincare recurrence time, which is much longer 

^ The precise condition needed for the vahdity of Birkhoff's theorem is metric indecomposabihty, i.e., the 
constant energy surface cannot be written as the union of two sets of positive measure, each of which is 
taken into itself under dynamical evolution. 
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than any timescale relevant to human observations. Nevertheless, there are much shorter 
"equilibration timescales" that should yield sufficient time for the system to explore enough 
of its surface of constant energy to justify the use of the Liouville measure to calculate 
probabilities. For example, for a box of gas, there is a "collision timescale" governing the 
amount of time needed for a given particle to significantly change its momentum and a 
"diffusion timescale" governing the amount of time needed for a given particle to move 
across the box. If one waits a time much longer than either of these timescales — but much 
shorter than a Poincare recurrence time — then the system should have evolved sufficiently 
to "forget" any "special" initial conditions, and the use of the Liouville measure to calculate 
probabilities should be justified. 

From the above discussion, it is evident that use of the Liouville measure to calculate 
probabilities will not be justified in the following (non-exclusive) circumstances: (i) the 
system is not ergodic, i.e., it does not explore all of its surface of constant energy; (ii) 
one has not waited a time much greater than the equilibration time after the system was 
prepared; (iii) the system has a time dependent Hamiltonian that is varying on a timescale 
that is small or comparable to the equilibration time. 

A situation in which case (i) occurs is a system where there are additional conserved 
quantities besides energy. However, in many cases, this difficulty can be dealt with by simply 
adjoining the additional conserved quantities to energy as additional "state parameters." If 
the system is ergodic on the level surfaces of all of the state parameters, then probabihty 
arguments similar to those where the system is ergodic on the constant energy surfaces can 
be made. However, a much more problematic occurrence of case (i) arises if the system is 
such that the surfaces of constant energy are non-compact and have infinite QM-measure, 
since then the system will not be able to suitably explore all of its available phase space. An 
example of such a system is a spatially unconfined gas. Another example is a gas of point 
particles confined by a box but interacting via Newtonian gravity. If there is no restriction 
on how close the particles can approach each other, the surfaces of constant energy will have 
infinite rijvf-mcasure, since the potential energy can decrease without bound, allowing the 
kinetic energy (and therefore the momenta) to increase without bound. The system will 
dynamically evolve in such a way that groups of particles become more and more tightly 
bound and the overall system becomes hotter and hotter, but "equilibrium" will never be 
achieved and one will not be able to use statistical physics arguments to predict the details 
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of the evolution. 

In case (ii), the system will "remember" its initial state, so the probabilities for obser- 
vation of a macroscopic property corresponding to region O may depend as much or more 
on the manner of preparation as on the ^M-nieasure of O. Case (iii) is essentially a variant 
of case (ii), but in case (iii) it will not help to "wait longer," since even the energy of the 
system at a late time t will continue to depend on the initial state and the details of the 
history of how the Hamiltonian varied with time. 

From the discussion of the previous section, it is clear that general relativity manifests 
all three of the above characteristics. Even in the minisuperspace case, the physical phase 
space of general relativity is non-compact and its total measure is infinite, so it is manifestly 
impossible for any solution to suitably "explore its phase space." Thus, one is automatically 
in case (i). Furthermore, the timescale for dynamical variation of inhomogeneous degrees 
of freedom corresponding to modes of wavelength comparable to the Hubble radius is of 
order of the age of the universe, so, clearly, there is insufficient time for equilibration of such 
degrees of freedom (case (ii) above). Indeed, even for subsystems that are much smaller than 
the Hubble radius but still large enough that self-gravitation is important, the "equilibration 
time" will be much larger than the age of the universe: As remarked above, for point particles 
interacting via Newtonian gravity, the phase space has infinite volume and no equilibrium 
is ever achieved. General relativity effectively provides a cutoff to this behavior via the 
formation of black holes, so it may be possible for an isolated, confined system to achieve 
equilibrium. Nevertheless, the timescale for matter to form and/or fall into black holes (and 
for the black holes to merge) will be very large, so even if case (i) does not apply, we are 



in the situation of case (ii). Finally, as we have seen in subsection |II B the Hamiltonian of 
general relativity depends upon the "time," K^,, and thus is effectively time dependent with 
respect to the degrees of freedom whose dynamics are sensitive to the Hubble expansion. 
Thus, the very long wavelength degrees of freedom are in case (iii). 

In summary, for isolated subsystems whose size is much smaller than the Hubble radius 
and for which self-gravitation is not important, there is no reason why ergodic-like behavior 
should not occur. If such subsystems have been provided sufficient time to equilibrate, one 
should expect the usual arguments of statistical physics to apply, and the Liouville measure 
should provide probabilities for observing macrostates. However, to determine a probability 
distribution for the macroscopic state of degrees of freedom whose scale is comparable to the 
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Hubble radius, one will need to know both the probability distribution for initial conditions 
and the evolutionary history of the universe. Even for subsystems that are much smaller 
than the Hubble radius, if self-gravitation is important in the dynamics, the equilibration 
time is so long that initial conditions will not be "erased." 

Thus, we claim that the only way to justify the use of the Liouville measure in cosmology 
would be to postulate that the initial conditions of the universe were chosen at random from 
a probability distribution given by the Liouville measure. In other words, one must postulate 
a "blindfolded creator" [21] throwing darts toward a board of initial conditions for the 
universe that was built using the Liouville measure. This is a very different justification of 
the Liouville measure from that used in ordinary statistical physics, and has the status of 
an unsupported hypothesis. 

Nevertheless, for the remainder of this paper, we will accept the validity of the Liouville 
measure. In the next section, we discuss the difficulties/ambiguities in obtaining probabilities 
from the Liouville measure associated with the fact that, even in minisuperspace models, 
the total measure of physical phase space is infinite. 



IV. DIFFICULTIES DUE TO INFINITE TOTAL MEASURE; THE PROBABIL- 
ITY OF INFLATION 

We have seen in section [TT] that — after truncation to a finite number of degrees of 
freedom — a Liouville measure, Q, can be defined on the physical phase space S. Let X 
denote a physical property of interest and let Ax C S denote the region of physical phase 
space that possesses that property. Then, if it were the case that fl{S) < oo, one could 
assign a probability, P{X), to X via 

HX) ^ ^ . (48) 



However, in the actual case where = oo (see (47)), the situation divides into three cases: 
(1) If Q{Ax) < oo, then one can unambiguously assign P{X) = 0. (2) If Q{S \ Ax) < oo, 
then one can unambiguously assign P{X) = 1. (3) If both Q{Ax) = oo and Q{S \ Ax) = 
oo, then P{X) is ambiguous and cannot be assigned a value without specifying additional 
information, such as a "regularization procedure." Here, by a "regularization procedure" we 
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mean a nested sequenc^of subsets {iS„} with U„iS„ = S and ri(iS„) < oo for all n. Given 
such a sequence {iS„}, we could attempt to define P{X) by 



n— >oo 



The difficulty is that in case (3) it is easy to see that one can get any answer one wishes 
for P{X) (including no answer at all, i.e., non-existence of the limit) by a suitable choice of 

{Sn}. 

This situation is well illustrated by the following simple example (also discussed in [T3]). 
Let N denote the set of positive integers and let Q be the measure on N that assigns to each 
subset of N the number of elements of that subset. Let X be the property of "evenness", 
so that Ax is the subset of even integers. What is the probability of an integer being even? 



If we try to directly apply (48), we obtain P{X) = oo/oo, which is ambiguous. However, 
we can obtain a well defined result by choosing a suitable "regularization procedure." For 
example, if we order the integers by size as {1,2,3,...} and let iS„ denote the integers 
appearing in the first n terms of this sequence, (49) yields P{X) = 1/2. On the other 
hand, if we ordered the integers as {1, 3, 2, 5, 7, 4, ... } and let S'^ denote the first n terms 



of this sequence, (49) yields P{X) = 1/3. This illustrates how P{X) depends on the choice 
of regularization. In this example, one could argue that the ordering {1,2,3,...} is more 
"natural" than the ordering {1, 3, 2, 5, 7, 4, . . . }, so that the "correct" value of P{X) is 1/2. 
However, the key point here is is that the determination of P{X) — if, indeed, it can be 
"determined" at all — requires more input than Q. 

The issue of the "probability of infiation" for minisuperspace models with the GHS mea- 



sure (see subsection II C above) provides an excellent illustration of this type of ambiguity. 
Using a natural regularization procedure. Gibbons and Turok [3J obtained an extremely 
small probability that the universe would have undergone ^ 1 e-foldings of infiation. 
However, using a seemingly equally natural regularization procedure Carroll and Tam |1] 
obtained a probability very close to unity that the universe would have undergone a large 
number of e-foldings of infiation. A clear explanation of the origin of this type of discrepancy 
was given long ago by Hawking and Page [2]. However, since their explanation does not 
appear to be widely known and since similar discrepancies have been noted in other, related 
contexts [22], [23], we shall provide a full discussion here. 

^ Here we assume that Q is "cr-finite" so that S can be written as a countable union of subsets of finite 
measure. 
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We consider the minisuperspace models of section II C with V{ip) = ^m'^ip^ and k = 0. 
We assume that m ^ 1 (in units where 8nG = h = c = 1), since, otherwise (see footnote [s] 
below), it will not be possible for (many e-foldings of) inflation to occur. The equations of 



motion (39) and (27) then reduce to 



= l{^' + mV') (50) 

H = -l^'. (51) 

The equation of motion for the scalar field (which can be derived from the above two 
equations) takes the form 

(f + 3H<p + mV = . (52) 



The GHS measure (46) on the surface S given hj H = reduces (up to a numerical factor) 
to 

l^GHs = a/ QH"^ — m?'^'^ a^dadip . (53) 

The allowed range of ip is 

\ipm\<V6—, (54) 
m 

but the allowed range of a{H^) is (0, oo), so the total measure of S is infinite, as previously 
noted. 

In the context of this model, we pose the following question: What is the probability, 
P{N), in the GHS measure that the universe underwent at least N ^ 1 e-foldings of inflation 
during times when H < 1 {in units where SttG = h = c = 1)7 

We can estimate the number of e-folds of inflation that will occur in these models as 
follows. During slow roll inflation, the potential energy dominates the kinetic energy, 0^ ^ 



m Lf , and we can approximate (50) and (52) by the slow roll equations 



^ \m^ip^ (55) 
6 

+ mV ^ , (56) 
from which it follows that during slow roll we have 

~ yi"^ • (57) 
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If inflation occurs, it should end when (p ^ m Lp , so the value of when inflation ends is 

^/^±y|, (58) 

i.e., inflation ends when (p gets too close to the bottom of the potential^ Let (pi denote the 
value of (f when H = 1 

'Pi = f\H=i (59) 



At H =1, the quantity !p{H = 1) = — m'^(ff may differ appreciably from the slow roll 
value (57). Nevertheless, as can be seen from flgure[l| if yjj is large enough so that inflation 



will eventually occur, then (f will approach the slow roll value before ip has a chance to 
change much, so the value of p> at which inflation begins is approximately ipi. Consequently, 
the number of e-folds of slow roll inflation is approximateljj^ given by 



V/ A=mip 1 1 (60) 

Thus, in our approximate treatment of the dynamics, for ^ 1, the points in phase space 
for which at least e-folds of inflation occur a.t H < 1 are the ones for which 

\ip,\>2y/N. (61) 

Whether or not inflation occurs in a given universe only depends on the initial value of (p. 
In particular, it does not depend on the initial value of a — as one would expect, since the 
value of a doesn't affect the dynamics when the spatial curvature vanishes. 

Although the measure of physical phase space is inflnite, the range of ip is bounded by 



(54) for any choice of if*, so the inflnity arises entirely from the unbounded range of a. 



Therefore, a very simple and natural way of "regularizing" the measure is to put in a cutoff. 



Qc, in a and then take the limit as Oc — ?■ oo. Since the measure (53) factorizes in ip and a 



® From (50), the value of H at the end of inflation is Hj w ^^m'^ip'j ^ m, so m must be very small 
compared to 1 if there is to be any chance of having many e-folds of inflation after the Planck scale, as 
claimed above. 

^ It is possible to improve this approximation. As shown in [3j, a better approximation to the ideal slow 
roll solution is iJ = + 3^) ■ Noting that Lp = —2dH/d(p, we obtain a better approximation for 

N: 

1 n 1 
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FIG. 1. The black curves in the left plot are some numerically calculated (flat FLRW-i/?) solutions, chosen so as to be evenly 
spaced in ip at the early time H = 100m, and with > at this early time (the solutions with lyi < can be obtained by 
reflection). Only the behavior of </3 (horizontal axis) as a function of H (vertical axis) is shown; the behavior of a is not shown, 
as the dynamics is independent of the value of a. A solution will undergo at least 60 e-folds of slow roll inflation if and only 
if it originates in one of the shaded regions. The right plot is a blow-up of the left plot near the origin. The solid curves are 
the solutions that have exited slow roll (i.e., they contain all the solutions drawn in the left plot, which are too close together 
at late times to distinguish on the plot) and the dashed curves are solutions that have never inflated. 

For the universes that undergo inflation, we see that ifi remains practically constant until tf reaches to the slow-roll value. 
Then the solution undergoes slow roll inflation along one of the diagonals (which are close to, but slightly offset from, the 
turning points ip = ±\/&H/'m) until H becomes small enough {Hj ~ m) and inflation ends. If we take the measure surface at 
an early time (large H/m), most of the solutions start out in the shaded region, but if we take the measure surface at a later 
time, a smaller fraction start out in the shaded region. As we discuss, this does not violate Liouville's theorem, since solutions 
are "spreading out" in the other phase space dimension, a. 



and the dynamics of ip does not depend upon a, we obtain 



P{N) = lim 

ac—^oo 



a?da 



(62) 



a^daj 1^ J \J QH^ — rn^ip^ dip 

where the (y9-integral in the numerator is taken over the values oi ip sX H = that come 
from trajectories with p > 2\/N at H = 1. Thus, we can simply cancel the integrals over a 
and — after performing the (/^-integral appearing in the denominator — we obtain the following 
well-defined formula for P{N): 

2m 



P{N) 



3ttH^ 



/ a/ 6H^ — iv?(p'^ dip . 



(63) 



By Liouville's theorem (see section |ll]), one might expect this expression to be independent 
of the choice of H^,. In fact, however, this expression is not independent of H^:. As we 
shall now show, if one evaluates P{N) at ^ m, one obtains a value very close to 1 
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(in agreement with Carroll and Tarn [1]), whereas if one evaluates P{N) at if* ~ m one 
obtains a value very close to (in agreement with Gibbons and Turok [3]). As we shall then 



explain, this strong ii^,-dependence of (63) arises from the innocuous-looking regularization 
we performed in a. 

Early-time evaluation (H^ ^ m) : As can be seen from figure [l| is nearly constant 
between H = 1 and H = H^, for trajectories that have not yet started inflation, so (63) 
reduces to 

-'(c)-Cv^r^ 



4 

TT 

2 

TT 



COS 



where C = a/ W' ^ ^ ^' 

we can make the approximation 



P{N) ^ 1 - -C . (65) 

TT 



For the choices if* = l/-\/3 and m = 3 x 10 ^ considerec by Carroll and Tam , this 
expression yields P(60) ~ 0.99996, which agrees with their calculatioij"^ 

Late-time evaluation (ii* = Hj ~ m): This choice of ii* corresponds to the end of 
inflation for trajectories that have undergone inflation (see footnote [S]) . The slow roll ap- 
proximation is no longer valid at this time, so the calculation can best be done by appealing 
to Liouville's theorem and evaluating the phase space volume of interest at ii = 1. How- 



ever, in order to apply Liouville's theorem, we must return to the regulated expression (62), 
namely 

2m f""" f I 

= a^^i^^ / "'^W J^H}-m-^^d^. (66) 

The joint integral over a and is just the GHS measure of the universes that underwent 

at least e- foldings of inflation and have a < ac when H = Hf. During the time before a 

trajectory starts inflating, the stress-energy tensor of (f can be approximated by that of a 

^'^ Carroll and Tam quote the value m — 3 x 10^"^, but we have confirmed that this was a typo. 

Note that for the given values of and to, Carroll and Tam calculate numerically that the range of 
initial values of ip which will not give at least 60 e-folds of inflation is —24 < if{H^,) < 6 (for the solutions 



with ip{H^) > 0), which differs from our estimate (61) of |(/9(-ff*)| < 2v60 ~ 15.5. This discrepancy arises 
from our approximation that ip is constant during the period before inflation; in fact, \dip/dH\ is small 
but not exactly 0, and the solutions will 'drift' in ip a.s H changes. However, this drift has negligible effect 
on the value of P{N), since all not-yet-inflating solutions drift by essentially the same amount (see figure 
jlj), and the integrand of (64) is nearly independent of (p near (p = 0. 
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perfect fluid witli equation of state p = p [2]. During this period a oc so a (X H ' , 
while (p remains practically constant, ip = ipi. Once H becomes small enough, slow roll 



inflation will begin; by (55), the value of H when inflation begins is 



Using (60), we then have 

a\rr rr ^ I r, rr ttL. „. /^/oA'^/^ 



a 



!^%^^e^if;V3^,.?/4^!!^y . (68) 
aL_i V V6 / 



H=l \H=Hs \H=l 



Thus, the condition o.\^_^ < etc is approximately equivalent to the condition o!\fj_^ < 

1 /3 

ttc (^^^ j e"*^'/"^. Thus, if we evaluate the GHS measure of the trajectories that we are 
considering at H = 1, we obtain (neglecting overall numerical factors of order unity) 



P{N) oc lim / — m^v^^ / a da 

ac^oo ma= Jo 
/•\/fi/m ^ 

OC / dip^/ 6 — m'^Lp'^ if e"'^'^ 

V&/m (69) 

Note that the factors of Oc canceled in the second line, so the expression is cutoff independent 
(and therefore there was no need to take the limit as Oc — ?■ oo after that point). The 



dependence P{N) ~ exp(— 3A^) agrees with the results of Gibbons and Turok [3] and 
yields P{N) ~ 10"^° for N ~ 60. 

Thus, the situation with regard to obtaining the probability of inflation in minisuperspace 
using the GHS measure can now be seen to be closely analogous to that of obtaining the 
probability that an integer is even in the example given at the beginning of this section. In 
the integer example, one must first "regularize" the measure by imposing a "cutoff" that 
restricts one to subsets with finitely many integers. Similarly, in the minisuperspace case. 



Gibbons and Turok obtained P{N) cx e^^^/ViV. Their additional factor of l/\/]V arises from their better 



estimation of N(ipi) (see footnote [9|) . If we had used this more accurate formula for N{ipi) in (68), our 
calculation would also yield P{N) oc e"^^ / \/N . 
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one must first "regularize" the measure by putting in a "cutoff" in phase space that restricts 
one to subsets of finite volume. In the integer example, a natural cutoff can be imposed by 
ordering the integers by size and putting in a cutoff at n. Similarly, in the minisuperspace 
case, one can order the models by their "size", a, at "time" H^, and put in a cutoff at Oc- 
The difficulty is that the subsets of phase space with a < Oc at "time" H = are very 
different in "shape" from the subsets of phase space with a < at "time" H = H'^. In 
particular, the universes with a < at an early time H^, ^ m that will undergo inflation 
will be systematically "pushed out" to much larger values of a at a late time if^, ~ m than 
universes that do not inffate. Therefore if one regularizes by imposing a cutoff in a at a late 
time, one will find that a much smaller percentage of universes will have inffated, as our 
calculations above explicitly show. 

Should one impose a cutoff in a at, say, the Planck time (see [23]) and conclude that 
inffation is highly probable? Or, should one impose a cutoff in a at a late time [21] and 
conclude that inffation is highly improbable? Or, should one impose an entirely different 
regularization scheme and perhaps draw an entirely different conclusion? Our purpose here 
is not to answer these questions but to emphasize that, even in this simple minisuperspace 
model, one needs more information than the GHS measure to obtain the probability of 
inflation. 



V. CONTRIBUTION OF INHOMOGENEOUS DEGREES OF FREEDOM 

In the previous section, we restricted our considerations to minisuperspace, i.e., the two 
dimensional submanifold of homogeneous, isotropic (FLRW) solutions. As discussed in 
section |TT| the pullback of the symplectic form to minisuperspace yields the GHS measure, 
which we used for the calculations of the previous section. However, minisuperspace is a set 
of measure zero in the full phase space. Even if we are only interested in "nearly FLRW" 
solutions, it is far from clear that the GHS measure will give a valid estimate of the phase 
space measure of the spacetimes that are "close" to a given FLRW solution. This difficulty 
is well illustrated by the following simple example (see figure [2]): Consider the plane R^, 
with measure VL = dxdy, consider the line y = with measure ^|j^_o — ^^"^ consider the 
(union of the) yellow and red shaded regions. If we choose a point in the shaded regions of 
the plane "at random" with respect to the measure f2, we will find that it is more likely that 
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the point lies in the red region than the yellow. But if we try to estimate this probability by 
choosing a point on the line "at random" with respect to the measure ^|y_o — ^-^^ '^^ ^^^^ 
get the wrong answer, i.e., we will find that it is more likely that it lies in the yellow region. 
It is quite possible that the situation for the phase space of general relativity could be quite 
similar: Specifically, the plane with measure fl = dxdy could be analogous to the full phase 



space of general relativity with the measure of subsection II B the line y = with measure 



^\y-Q ~ could be analogous to minisuperspace with the GHS measure of subsection II C 
the (union of the) yellow and red shaded regions could be analogous to the spacetimes that, 
physically, are "close" to FLRW models; and the yellow and red regions could correspond to 
different properties of interest, e.g., the yellow region could correspond to universes that do 
not infiate, and the red region could correspond to universes that do. The minisuperspace 
calculation would then erroneously predict a low probability of infiation, since the full phase 
space calculation would give a high probabihty of inflation. 




FIG. 2. A region of the plane R^. The black line y = lies more in the yellow region than in the red region, but the red 
region is larger. In the "real" measure, f! = dxdy, a randomly chosen point is more likely to lie in the red region. But in the 
restricted measure, Cl _ = dx, a randomly chosen point is more likely to lie in the yellow region. 



In this section, we will calculate the effects of including inhomogeneities on the measure 
of nearly FLRW models, thereby obtaining the modifications to the GHS measure produced 
by the type of effect illustrated in figure [2j Before presenting these calculations, however, 
we briefiy discuss three significant difficulties of principle — for which we have no satisfactory 
resolutions — regarding the inclusion of inhomogeneous degrees of freedom, even when we 
restrict consideration to nearly FLRW models. 
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A. Difficulties Arising in the Treatment of Inhomogeneous Degrees of Freedom 

The most serious difficulty arising in the treatment of inhomogeneous degrees of freedom 
is that there are infinitely many of them. As discussed in section |TT], in order to effectively 
reduce the system to a finite dimensional phase space, it is necessary to impose a short 
wavelength cutoff. There are obvious difficulties in imposing such a cutoff, since there 
is arbitrariness in the choice of cutoff, and, whatever choice is made, there undoubtedly 
will remain some coupling between the long wavelength degrees of freedom that are being 
kept, and the short wavelength degrees of freedom that are being discarded. However, in 
cosmology, there is an additional major issue of principle that arises from the fact that the 
physical wavelength of a mode is time dependent. In particular, if inflation occurs, a degree 
of freedom corresponding to a linearized perturbation of wavelength of order of the Planck 
scale at a time prior to inflation could correspond to a wavelength of order of the Hubble 
radius or larger in the present universe. If one imposes a short wavelength cutoff at a fixed 
physical scale such as the Planck scale, then the dimension of the phase space will vary 
with time, making it impossible to apply Liouville's theorem or any usual statistical physics 
arguments. On the other hand, if one puts in a cutoff at a fixed co-moving wavelength 
so that the number of degrees of freedom does not change with time, then, in order to 
consider physically relevant degrees of freedom in the present universe, it may be necessary 
to consider degrees of freedom with wavelength smaller than the Planck length in the very 
early universe. 

Another difficulty is that the notion of whether an inhomogeneous cosmology is "nearly 
FLRW" is a highly time dependent notion. Our own universe is believed to have been 
extremely homogeneous and isotropic on all scales during a much earlier phase, but is highly 
inhomogeneous on small scales now. On the other hand, most universes that are nearly 
homogeneous and isotropic at a Hubble constant corresponding to the present value, Hq, in 
our universe were extremely inhomogeneous at a much earlier phase [1] . Thus, the universes 
that are "nearly FLRW" at H = Hq comprise a very different class of universes than those 
that are "nearly FLRW" at H :$> Hq. 

Finally, even if we select a particular value, H = H^, to impose a short wavelength cutoff 
and impose the requirement that inhomogeneities be "small," it is not obvious how to define 
"smallness." We will make such a choice in our calculations below, but one could argue 
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that other choices are equally "natural," and it is clear that the results will depend in a 
significant way upon the definition of "smallness." 

B. The Symplectic Form for Perturbations of FLRW 



In section II C we evaluated the symplectic form of general relativity, 5ix, ^2x1^ when 



6iX and 62X are homogeneous perturbations off an FLRW background X] the result of this 



calculation is given by (38). In this subsection, we will evaluate uj[x', Six, S2X] when 6ix and 
62X are arbitrary (inhomogeneous) perturbations off of an FLRW background. This will 
enable us to obtain the Liouville measure on the space of "nearly-FLRW" universes, which 
we shall do in the next subsection. 

We will mostly follow the notation of Mukhanov et al. [25J (although we use the opposite 
metric signature). In conformal time, 

(™) 

the metric of an FLRW universe takes the form 

ds"^ = a^irj) {—drf + jijdx'^dx^) , (71) 
and the Friedmann equations take the form 

H' + n = l(^l{v',r + a'V{vo?j (72) 

^ + '^ = -^^K)'-2aV(v.o)) , (73) 
where ' denotes differentiation with respect to 1], and 

H =- = d = aH . (74) 

a 

We write, 

= V,r/ , (75) 

and we define Da to be the derivative operator associated with 'jab- 

As is well known, there is a natural way to decompose any perturbation of an FLRW 
solution into its scalar, vector, and tensor parts [25], |26] , 

Sgab = (<^5ab)scalar + (-^^afe) sector + i^9ab)tensor ' (^6) 
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where 

(^^?a6)scalar = 2 a' {-<PUb + ^iaDb)B - ^JJab + D,D,E) (77) 

i^9ab\e.,o. = {i(aU,) + Z^(.H)) (78) 

('^^a6)te„sor = • (79) 

Here [/" and are tangent to the spatial shces and are divergence free {DaU"" = DaV"^ = 0), 
whereas hab is also tangent to the spatial slices, traceless (/i"„ = 0), and divergence free 
{D"'hab = 0). In addition to the metric, we must also perturb the scalar field ip, 

ip{r],x') = ^o{v) + Sviv,x'), (80) 

and 6ip is classified as belonging to the scalar part of the perturbation. 

It is well known that the different perturbation types are decoupled in the linearized limit, 
i.e. the linearized Einstein equations break up into independent scalar, vector, and tensor 



pieces. The symplectic form, given by (23), is quadratic in the perturbations, but it is not 
difficult to see that the "cross terms" between the different perturbation types integrate to 
zero, so we have 

<^ = l^scalar + l^vcctor + l^t ensor ) 

(81) 

where cjscaiar only contains terms quadratic in scalar perturbations, ^vector only contains 
terms quadratic in vector perturbations, and extensor only contains terms quadratic in tensor 
perturbations. 

We will now calculate the scalar, vector, and tensor perturbation contributions to the 
symplectic form, u, and its pullback, u, to the constraint submanifold. 



1. Scalar Perturbations 

Scalar perturbations naturally decompose into a "homogeneous part" (corresponding to 
perturbations towards other FLRW models) and an "inhomogeneous part" (for which the 
spatial integrals of 0, and 6ip vanish). The computation of co[x',Six,S2x], when 6ix and 
62X are homogeneous perturbations was already done in section |II C There are no cross 



terms in u between homogeneous and inhomogeneous perturbations, so it remains only to 
calculate u[x] SiX, ^2X1^ when 6ix and 62X are inhomogeneous perturbations. 
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From (77) and (80), we see that there are 5 scalar functions, {(f>,ilj, B, E,Sip), defining 
an inhomogeneous scalar perturbation. However, there is gauge freedom to eliminate two 
of these functions via 6gab Sgab + V(aff,) with Va = a^a + Daf3. Thus, there are three 
independent gauge-invariant variables p6], and we will use the choice of variables given in 



$ = 0+-[a (B-E')]' 
a 

^ = -(5-^0 
a 

a = 5^ + ^'^{B - E') . 
In terms of these variables, the scalar linearized Einstein's equations are [25] 



(82) 



1 



^'^ {a' - ^'^^) + a' 



dV 

dip 



a 



^0 



1 



^" + if ($' + 2^') +[2H' + H'^j $ - + -/^^(^ - ^) 



dV 

dip 



a 



(83) 
(84) 

(85) 



where spatial indices are raised by 7*-^, and D = D^Di. Equation (85) implies that 



(86) 



Plugging an FLRW solution and two inhomogeneous perturbations of the form (77, 80) 



into the formula, (23), for w, and using (82) and (86), we obtain 



'^scalar 
inhom 



-2a2 ^ d^x^ |3$;<l>2 + \ [Vo$i(72 - a[a2] | - (1 O 2) 
+ (terms that vanish when the Unearized Einstein equations are satisfied) . 



(87) 



Here we have integrated by parts and dropped surface terms, since we are only considering 
universes with compact spatial slices. We have not explicitly written out the terms that 
vanish when the linearized Einstein equations are satisfied, since for the calculation of the 
Liouville measure, we will pull back the symplectic form to the constraint submanifold, and 
these terms will vanish. Note that, as we discussed in section [IT], the symplectic form should 
have "pure gauge" variations as its degeneracies when pulled back to the constraint surface; 
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this can be seen explicitly in 87 since, when the linearized Einstein equations are satisfied, 
the non- vanishing terms have been written exclusively in terms of gauge invariant variables. 



Using (83), (84), and (86) to eliminate cr and a' in terms of $ and we obtain 



scalar 
inhom 



^ d\^{^', {D' + 3k) $2} - (1 ^ 2) 



+ (terms that vanish when the Unearized Einstein equations are satisfied) . 

We can further rewrite this expression by eliminating $' in terms of the gauge invariant 
density contrast [1] 



where 



5p + p',{B-E') 



p = _lvVVa<^ + \/(<^) 



Using (83), we find that when the linearized Einstein equations hold, we have 

2 



(89) 
(90) 

(91) 



We thereby obtain 
2apo 



'^scalar 



% [ d^x^\6i [D^ + 3k) $2) -(1^2) 



inhom 3H(p^ 

+ (terms that vanish when the Unearized Einstein equations are satisfied) 



(92) 



where we have used (70) and (74) to switch from conformal time to proper time. 



We can expand a general inhomogeneous perturbation in eigenfunctions of the Laplacian 
(which has a discrete spectrum since our spatial slices are compact). Let /(„)(x*) for 
n = 0, 1, 2, ... satisfy 

D^f{n) = -klf in) (93) 
with ko = and fc„ nondecreasing in n, and 



(94) 



where we remind the reader that we have normalized jij so that d^x^ = 1. We expand 

00 

n=l 

00 



(95) 



n=l 
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Substituting these expansions into (92), pulling the expression back to the constraint sub- 



manifold, and adding in the homogeneous contribution cJghs) we obtain 



'^scalar - '^ghs + ^ ^scLr ' (^6) 
n=l 



where 

^ 2apo (g - 3/.) („) („) ^ a (g^ + /.a-^) (fc^ - 3/.) („) („) _ 

scalar g^^g (3if2 _ l/(<^o) + S/ta^^) V > 

Finally, we comment that an irrotational fluid with equation of state P = wp can be 
treated within our formalism by setting = and replacing X = {—^V^ipVaf) with 



in the Lagrangian (|13|), with i = ^y^. Carrying out the analogous calculations in the case 



K = 0, we obtain 



This agrees with the expression obtained by Carroll and Tam [4J for the case p = p/3. 



2. Vector Perturbations 



From (78), we see that there are two divergence-free spatial vectors, Ui and l^, defining 
a vector perturbation. However, there is gauge freedom to eliminate one of these vector 
fields via 5gab — > 5gab + V(af b) with f „ divergence free and tangent to the spatial slices. The 
combination 

V, = U,- VI (99) 



is gauge invariant [27]. One can calculate that the only term appearing in the integrand of 
i^vector is proportioual to [(V^i)*-D^(V^2)i ~ (1 ^ 2)] . However, the constraint equation is 



L>Vi = 0, (100) 

where the right hand side is zero because the stress energy tensor has no vector part for 
scalar field matter. Thus, we have 

C^vector = . (101) 
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3. Tensor Perturbations 



The tensor perturbation (79) involves a symmetric, traceless, and divergence-free spatial 
tensor field, hij. There are no constraints on hij or its time derivative arising from Einstein's 
equation, and is gauge invariant. Plugging two tensor perturbations (79) into the formula 



(23) for the symplectic form, we obtain 



^tensor = —^a' J d'x^ [^^''^'^^^ ~ ^ ^^J " ^^^^^ 

Let {f(n)\j for n = 1,2, ... denote an orthonormal basis of divergence free and traceless 
eigentensors of -D^, i.e., 

^'(/(n))., = -^n(/w),, , (103) 

with kn nondecreasing in n, and 



d'x^{f^^))^^{f^m)T =5n,m (104) 

(we remind the reader that spatial indices are raised by 7*-^). We expand 

00 

^.. = E^^"^(^)(An))., • (105) 
n=l 

(Note that we are including both polarizations in the sum.) We obtain 

00 _ 00 

n=l n=l 

The formula for extensor is identical, as there are no tensor constraints. 



C. Modifying the GHS Measure by Including Inhomogeneities, and the Effect on 
the Probability of Inflation 

As discussed is section [TT], in order to obtain a finite dimensional phase space, we impose 
a short wavelength cutoff. We do so by discarding all scalar modes that have n > A/i and 
all tensor modes that have n > JV2. (Note that if we put the cutoffs at approximately the 
same values of wavelength, i.e., if fc^^ ~ kjj-^, then we will have A/2 ~ 2A/i, since there are 



two tensor polarizations.) Imposing these cutoffs and combining (96) with (106), we obtain 
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the following expression for the symplectic form for perturbations of FLRW, pulled back to 
the constraint surface: 



uj = cJ^,, + V {kl - 3k) ci<l>(") A + 7 ^^^"^ A . (107) 

As discussed in section [IT], to obtain a nondegenerate symplectic form, we further restrict 
to the surface S defined by i^' = Sif*, i.e., we impose if = if* on the background FLRW 



spacetime and SK = on the perturbations. In addition, as mentioned in II B , we must fix 
the spatial coordinates as well. However, our variables describing inhomogeneous perturba- 
tions have been written in a gauge invariant form, so there is no change in their functional 
form when we restrict to 6K = and fix spatial coordinates. Furthermore, homogeneous 
spatial diffeomorphisms act trivially on the homogeneous perturbations. Thus, we obtain 

a; L = \ * ^ i _ \(l^n^da 

ii*(3ii2-y(^o) + 3«:a-2) " > ^ ' 

. Ml 

4 ^ 

n=l 



where we have used the explicit form of the GHS measure, (46), derived in section II C 



To obtain the Liouville measure on our truncated, 2(1 + Afi + A/2)-dimensional phase 
space, we take the top exterior product of a;|^, thereby obtaining 

n = ^ a(i+m+a/-2) I 



3a2 {6H^ - mV + 4Ka 
^/6H^ — m?ip'^ + 6Ka~^ 



dip A da 



A 



A 



lrr2^ -2X \ M (109) 



'1 \ A/'2 A/2 

/ r, — n 



71=1 



n=0 

This expression for the 2(1 + A/i + A/2)-form Vt is, of course, valid only when evaluated 
at exact FLRW models. However, if we restrict consideration to solutions that are suf- 
ficiently close to FLRW models, we should be able to neglect the dependence of VL on 
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(<l'(^\5(i\...,$(-^i\5(-^i),/i(i)/i(^\...,/i(-^2)/^(A/-2)) and continue to use (|T09| to calculate 
phase space volumes of regions sufficiently near FLRW models. 

We now wish to integrate Q over a range of perturbation variables corresponding to "small 
inhomogeneities," so as to obtain the "true" measure of nearly FLRW spacetimes (see figure 
|2] and the associated discussion at the beginning of this section). To do so, we need to 
decide how to define "small inhomogeneities", i.e., how to choose which universes are "nearly 
FLRW". Following Carroll and Tam [3], we will choose the variables $ and 6 to indicate the 
"size" of scalar perturbations, since |$| ^ 1 means that the metric perturbation is small 
compared to the background metric, and \6\ <^ 1 means that the density perturbation is small 
compared to the background density. For the tensor perturbations, we will choose h and h/H 
as indicators of "size," since |/;,| ^ 1 means the metric perturbation is small, and \h\ H 
means that the perturbation of the extrinsic curvature is small compared to the background. 
Thus, we provisionally define "nearly FLRW" universes at "time" to be those that have 
\^{H = H,)\ < e, \6{H = H,)\ < e, \h{H = H,)\ < e, and \h{H = H,)\ < H,e for some 
e > 0. (Here, for simplicity, we have taken e to be independent of n and the same for scalar 
and tensor modes.) Then, the modified GHS measure on minisuperspace, which takes the 
"thickening" effects of inhomogeneities into account, is given by 



|${'l)|<£ 

|<5(")|<e 

l'^'">l<^ (110) 

4e^a{H! + .a-^) ■ ( f\ (k^ - 3.)] ■ (e^a^Hy^ ■ n,^, , 

H,{3H^-Viifo) + 3Ka-^)) l^A A/ " ^ ^ *^ 

where fleas is giveiij^ by equation (46). For the case of k = and V{(f) = ^m'^if'^, and 

choosing A/'2 = 2A/i = 2Af, we obtain 



Note that ^ghs varies with a as a^+^-^, so including more perturbation modes makes the 
large-a divergence more severe. In particular, including the effect of perturbations on the 
measure does not alleviate the difficulties discussed in section [IV] associated with the infinite 
total measure of minisuperspace. 

Note that 51ghs is conserved under dynamical evolution in minisuperspace, but fleas is not because the 
"boxes" of size e in our perturbation variables do not maintain their size under evolution. 
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It is easily seen that fi^Hs has a non-integrable divergence (for Af > 2) as (fo ^ ±y/6H^,/m, 



which hes at the boundary of the range of <^o allowed by the Hamiltonian constraint (50), 
and corresponds to 0o ~^ 0- The origin of this divergence can be traced to the fact that ^^"'^ 
and 5*-"^ become linearly dependent at (po = ±\/6H^:/m. Thus, as (po — j- ±\/6H^:/m, a box 
of "size" e in and 5*^"^ allows an unboundedly large range of other variables such as 

^"^f 'I^^W + n 12) 



where a was defined by (82). We will therefore supplement our definition of "nearly FLRW" 
universes by imposing the additional requirement that |(t'^"''| <^ ^Iv^ol when LpQ is close 
to zero. This eliminates the divergence at (^q — ?■ ±^/6H^:/m that occurs in (111) without 
affecting our formula for the measure when (pQ is not close to ±\/6H^:/m. 



Let us now compute the probability of inflation in parallel with the analysis of section IV 



but using the measure ^ghs rather than ^ghs- First, we perform an "early-time evaluation" 
{H^ ^ m) of the probability of inflation. For fixed a, the GHS measure is peaked at y^o = 
and decreases to zero as (po approaches the boundary of its allowed range, so the GHS 
measure is peaked on universes that have the least amount of inflation. By contrast, ^ghs is 
more peaked near the boundary of the allowed range of (po, where the most inflation occurs. 
Thus, the probability of inflation computed using ^ghs at early times will be even higher 



than the (extremely high) value obtained in section IV using ^^ghs- 

On the other hand, suppose we perform a late-time evaluation (ff^, ~ m) of the probability 
of inflation. In this case, the main effect of including the perturbation modes comes from 
the change in the a dependence of the measure from ^ghs oc to ^ghs oc a^"*"^"^. Carrying 



out the analysis in parallel with section |IV[ we obtain 

P{N) ~ e-(3+7A^)^ , (113) 

which is much smalleim than the (extremely small) probability of inflation calculated in 



section IV using fic 



As in section IV, our purpose here is not to advocate for performing the calculations at 
early times or at late times, nor to advocate for our particular choice of definition of "nearly 
FLRW" spacetimes. Rather, our purpose is merely to illustrate in a concrete manner the 



Indeed, if the "box size," e, is not extremely small, one may argue that the probability of inflation should 
be even further reduced because some of the universes that are deemed to have undergone inflation in 
this calculation will not have done so because the spatial inhomogeneities were too large. 
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additional issues that arise — and the additional choices that need to be made — when one 
takes the inhomogeneous degrees of freedom into account. 

VI. IMPOSSIBILITY OF RETRODICTION 

In the previous sections, we have highlighted difficulties that arise when one attempts to 
use the natural Liouville measure of general relativity to calculate probabilities in cosmology, 
such as the probability of inflation. Some of the difficulties are a direct consequence of the 
fact that the measure of phase space is infinite. A possible approach to dealing with this 
problem is to use our knowledge of (or beliefs about) the present universe to reduce the 
relevant region of phase space to one that has finite volume. In particular, we know/believe 
that the present universe is nearly an FLRW model. We do not know the present value 
of the scalar factor, a, of this background FLRW model, but we can simply assume that 
it takes some particular value oq. Suppose we restrict consideration to the region. A, of 
phase space corresponding to universes that differ by a suitably small (but finite) amount 
from this background FLRW model at the time when the trace of the extrinsic curvature 
is equal to the present value, Hq, of the Hubble constant. If we let B denote the subset of 
universes that undergo suitably many e-foldings of inflation, then one might argue that — 
taking account of our knowledge of the present state of the universe — the probability that 
our universe underwent an era of inflation should be given by 

where Q is the canonical measure of general relativity, and the right side is now well deflned, 
since A and B (1 A have flnite measure. Indeed, this formula essentially corresponds to the 
calculation of the previous subsection taking = Hq\ a, calculation of this nature was also 
carried out in We refer to this type of calculation as a "retrodiction" because it uses 
present conditions as an input to determine the likelihood of past conditions. The purpose 
of this section is to argue that, in a universe in which the second law of thermodynamics is 
valid, this type of calculation is essentially guaranteed to give the wrong answer, i.e., it will 
assign an extremely low probability to what actually did happen in the past. This "futility 
of retrodiction" has been discussed in depth by Eckhardt [28], and we refer the reader to 
that reference for additional discussion. 
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In ordinary statistical mechanics, we can use knowledge about the present macrostate of 
a system to successfully predict the hkely future evolution of the system. This is done by 
considering all of the possible microstates that are consistent with the present macrostate 
and evolving them forward in time using the (microscopic) dynamical equations. If it is 
found that an overwhelming majority (in the Liouville measure) of these microstates will 
have some given property at some specified time in the future, then we can be confident 
in predicting that the physical system will have that property at that specified time. We 
refer to this type of argument as a "trajectory counting argument" . The key point is that 
although one can use a trajectory counting argument to predict the future, one cannot use 
a trajectory counting argument to "retrodict" the past. The time reversal invariance of 
the microscopic dynamics assures us that most trajectories through phase space that pass 
through a present non-equilibrium macrostate have entropy that is increasing both into 
the future and into the past. However, it appears that we live in a universe in which the 
second law of thermodynamics holds, where entropy increases only in one time direction 
("the future"). In practice, we find that trajectory counting arguments do give correct 
results when used to make predictions; i.e., systems do not appear to be evolving towards 
some "special final state." However, because entropy in our universe decreases towards the 
past, trajectory counting arguments will always give incorrect answers when used to make 
retrodictions. 

A simple example will illustrate this point. Suppose that at 12:00 there is a cup of water 
with a temperature slightly above freezing and with a small ice cube floating in it. It is 
known that the cup was isolated since at least 11:59, and it will remain isolated until at 
least 12:01. Was there a larger ice cube at 11:59? A trajectory counting argument for this 
case would consider all possible histories that have a small ice cube in the cup at 12:00 and 
calculate the fraction (in the Liouville measure) of those histories for which there is a larger 
ice cube in the cup at 11:59. Very few trajectories have this property; the overwhelming 
majority have a smaller ice cube (or only liquid water) at 11:59. Indeed, since this system is 
time reversal invariant, the trajectory counting argument will give the same answer whether 
we go forward or backward in time. The trajectory counting argument will correctly predict 
that there will be less ice by 12:01. It will incorrectly retrodict that there was less ice at 
11:59. 

The same behavior applies in cosmology. If, as above, we let A be the set of universes 
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that can be described as "nearly-FLRW today," then most universes in A were extremely 
inhomogeneous at early times [1]. Indeed, if we consider almost any universe that looks 
like ours now and reverse-evolve it into the past, we would find that inhomogeneities would 
generically grow (into the past) — so much that they could no longer be described by the 
linearized theory. Infiation (or, "defiation," since we are evolving backwards) would surely 
not occur at early times. In other words, in the set of all possible histories that are consistent 
with our current conditions, the most numerous type of history (in the Liouville measure) 
is one in which the universe was highly inhomogeneous in the past — with many regions of 
"delayed big bangs," i.e., "white holes" — and just managed to smooth itself out enough by 
our present time so that it could be approximately described today as a linear perturbation 
of FLRW. 

As we have just argued, in a universe in which the second law of thermodynamics holds, 
we cannot test a hypothesis about the past by retrodiction, i.e., we cannot test it by deter- 
mining whether the hypothesized occurrence is generic in the space off all possible histories 
consistent with present observation. Rather, to ascertain what happened in the past, we 
make hypotheses about the past based on considerations such as simplicity (a la "Occam's 
Razor") and mathematical elegance. Once we have made such a hypothesis, we can test 
it using prediction rather than retrodiction. If a simple and/or elegant hypothesis about 
the past makes predictions — via trajectory counting arguments — that are in agreement with 
observation, this is taken as evidence in favor of the hypothesis. The simpler and more 
elegant the hypothesis and the more it accounts for, the stronger we consider the evidence 
in its favor. In a universe in which the second law of thermodynamics is obeyed, this is 
essentially the only way to get information about the past state. For example, we deduce 
that light element nucleosynthesis must have occurred in the early universe not by taking 
the nuclear abundances of the present universe and retrodicting past de-synthesis, but by 
making very simple hypotheses about the state of the early universe and predicting the 
abundances we see today. The evidence for light element nucleosynthesis is strong because 
much is explained from a simple hypothesis. 

In a similar manner, we assert that if one wishes to make an argument in favor of infiation 
having occurred in the early universe, this argument must be based upon its being a simple 
and/or elegant hypothesis that accounts for observed phenomena. Any argument about the 
"likelihood" of inflation based upon the Liouville (or other) measure on phase space will 
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require a justification for tlie use of this measure (see section III) as well as a justification 



for the regularizations used to obtain probabilities from this measure (see sections IV and 



Vj). Furthermore, as we have argued in this section, we cannot obtain any useful information 
about whether inflation occurred in the early universe by determining whether it is generic 
in the space of all universes that look like ours today. 
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